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Abstract 

Avian coronaviruses, including 
infectious bronchitis virus (IBV), аге 
important respiratory pathogens of poultry. 
The heavily glycosylated IBV spike protein is 
responsible for binding to host tissues. 
Glycosylation sites in the spike protein are 
highly conserved across viral genotypes, 
suggesting an important role for this 
modification in the virus life cycle. Here, we 
analyzed the N-glycosylation of the receptor- 
binding domain (RBD) of IBV strain M41 
spike protein and assessed the role of this 
modification in host receptor binding. Ten 
single Asn-to-Ala substitutions at Ше 
predicted N-glycosylation sites of the M41- 
RBD were evaluated along with two control 
Val-to-Ala substitutions. CD analysis revealed 
that the secondary structure of all variants was 
retained compared with the unmodified M41- 
RBD construct. Six of the ten glycosylation 
variants lost binding to chicken trachea tissue 
and ап ELISA-presented 0o2,3-linked sialic 
acid oligosaccharide ligand. LC/MSE 
glycomics analysis revealed that glycosylation 
sites have specific proportions of N-glycan 
subtypes. Overall glycosylation patterns of 


most variant RBDs were highly similar to 
those of the unmodified M41-RBD construct. 
ш silico docking experiments with the 
recently published cryo-EM structure of the 
M41 IBV spike protein and our glycosylation 
results revealed a potential ligand receptor site 
that is ringed by four glycosylation sites that 
dramatically impact ligand binding. Combined 
with the results of previous array studies, the 
glycosylation and mutational analyses 
presented here suggest a unique glycosylation- 
dependent binding modality for the M41 spike 
protein. 


Avian coronaviruses of poultry cause 
significant disease with subsequent economic 
losses in several commercially farmed bird 
species. Avian Infectious Bronchitis Virus 
(IBV) is a  Gammacoronavirus that 
predominantly affects domestic fowl, 
primarily chickens (Gallus gallus). The virus 
initially infects upper airway epithelium 
tissues and, depending on the IBV strain, 
disease outcomes range from mild respiratory 
disease to kidney failure and death (1). 


6107 ‘STZ PEN uo 1son3 Aq /310'0qf mmm: dyy шош popeo[uMo(q 


The viral envelope of IBV contains the 
highly glycosylated spike (S) protein which is 
post-translationally cleaved into two domains: 
51 and S2. This S glycoprotein is the major 
adhesion molecule of the virus. It is a class I 
viral fusion protein, in which the variable 51 
domain is involved in host cell receptor 
binding and the more conserved S2 domain 
mediates the fusion of the virion with the 
cellular membrane (2,3). The role of spike in 
host cell attachment and the induction of 
protective immunity has been reviewed (4). 
The spike protein monomer is а 
transmembrane glycoprotein with a molecular 
mass of 128 kDa before glycosylation (3). A 
cleavable N-terminal signal peptide (5) directs 
the S protein toward the endoplasmic 
reticulum (ER), where it is extensively 
modified with N-linked glycosylation (6,7). 
After glycosylation in the ER, the monomers 
oligomerize to form trimers (6-9). 

The N-terminal 253 amino acids of 51 
were shown to encompass the receptor- 
binding domain (RBD) of IBV strain M41 
(10), which interacts with — sialyl-02,3- 
substituted glycans present on the host's cell 
surface (11,12). Ten N-linked glycosylation 
sites are predicted to exist on the M41-RBD 
(5), of which most are highly conserved 
(Supplemental Fig. S-1). It is interesting that 
eight of the ten sites are 95-100% conserved. 
Sites N33 and N59 were less conserved at 
80% and 25%. However, each had a nearby 
alternative site which was also highly 
conserved. Alternate site N36 was conserved 
50% of the time and one or both N33 and N36 
was present in 94% of the sequences. Site 
N57 was conserved at 73%. In 97% of the 
sequences either N59 or N57 was present but 
never together. Therefore, all ten sites 
including the alternatives likely serve 
important functions 

N-glycosylation of viral glycoproteins 
is known to modulate the ability of viruses to 
infect host cells and to be recognized by the 
host's immune system (13). Recently, Zheng 
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et al. (2018) studied extracted spike proteins 
and mutant viruses with (asparagine to 
aspartate)  N-to-D апа (asparagine to 
glutamine) N-to-Q mutations at 13 predicted 
glycosylation sites in the S protein of the 
Beaudette IBV strain (14). Their results 
indicate that glycosylation at some sites on the 
Beaudette S1-RBD was important for viral 
fusion and infectivity which may include host 
recognition. However, the Beaudette strain is 
a cell culture adapted strain, is non-virulent in 
chickens (15) and does not bind chicken 
tissues known to be important for infectivity 
(11), making it difficult to extrapolate these 
results to clinically relevant IBVs. 

To characterize and assess the role that 
glycosylation plays when interacting with host 
tissues through the RBD of pathogenic IBV 
strain M41, we used a combination of 
molecular and analytical techniques including 
histochemistry, ELISA, circular dichroism 
(CD), mass spectrometry, and docking 
analysis as listed in Table 1. Systematic 
deletion of each glycosylation site and 
histochemical analysis of each variant 
revealed which of the ten glycosylation sites 
affect the binding of IBV S-protein to host 
epithelial tissue. Site occupancy analysis by 
LC/MS* indicated that at least 9 out of 10 
predicted N-glycosylation sites in the М41- 
RBD domain are glycosylated. Analysis of 
site occupancy апа signature N-glycan 
patterns at each site in combination with 
single glycosylation site deletions provided 
insight toward the biological relevance of each 
of those sites in binding to host tissue 
receptors. Overall, our data confirms that N- 
glycosylation plays a critical and likely unique 
role in binding of the IBV spike domain to its 
host tissue receptors. 


RESULTS 

Gel electrophoresis and CD analysis indicate 
that M41-RBD and glycosylation variants are 
similarly expressed, folded and stable. To 
analyze the role of glycosylation of M41-RBD 
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in receptor binding, missense mutants (N-to- 
A) were generated on a site-by-site basis at 
each of the predicted N-glycosylation sites. 
Recombinantly produced glyco-variant RBD 
proteins migrated with Ше same 
electrophoretic mobility as unmodified M41- 
RBD (Fig. 1). The RBD proteins were 
evaluated by circular dichroism (CD) 
spectroscopy to assess similarity to the wild 
type secondary structure. Wild type M41- 
RBD, all ten glycosylation-site variants, and 
two non-glycosylation variants, V57A and 
V58A, were analyzed for secondary structure 
differences at 25 °C. Thermal melts were 
performed on each construct from 25 °C to 95 
°C followed by full scans collected at 95 °C 
and again at 25 °C after the melt. Overlays of 
all the CD spectra can be found in 
Supplementary Fig. S-2. Visually, all spectra 
at all temperatures follow the same curve. The 
N85A spectra were generated at higher protein 
concentration but aligned well to CD spectra 
of all other variants when normalized to the 
percent of maximum signal. Likewise, all the 
proteins had analogous broad melting curves 
suggesting the proteins were similarly stable. 
Protein folding was reversible for all proteins, 
with comparable recovery rates (see CD- 
25°C-aftermelt-normalized in Supplementary 
Fig. 5-2). Dichroweb (16) was used to 
calculate the percent of a-helix, B-strand, turn, 
and unordered portions of the protein in the 
initial 25 °C spectra to estimate secondary 
structure differences between the proteins 
(Fig. 2). The percent of a-helix varied with the 
extremes being unmodified RBD and N145A. 
М145А exhibited 19.5 + 0.3% a-helix 
character as compared to wild type which has 
31.6 + 2.4%. Interestingly, М145А gave a 
very strong signal in the histochemical assay 
(Fig. 3A) and had the most notably different 
released glycans signature compared to the 
other constructs. We conclude that all proteins 
maintained a very similar structure and 
therefore suggest that single N-glycosylation 
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sites are by themselves not indispensable for 
protein folding or stability. 


Six out of ten glycosylation variants abrogate 
binding to host tissue and sialic acid. Since 
we established that all variant M41-RBD 
proteins are folded, we investigated their 
abilities to bind tissue receptors. Recombinant 
proteins were incubated with chicken trachea 
tissue sections and examined by histochemical 
analysis. N145A, N219A, N229A, N246A, 
V57A, and V58A bound ciliated epithelial 
cells of the chicken trachea with similar 
staining intensity as the unmodified RBD with 
the most intense staining associated with the 
N145A construct (Fig. 3A). In contrast, 
binding of constructs N33A, N59A, N85A, 
N126A, N160A, and N194A to trachea tissue 
was not detectible. Removal of sialic acids by 
treatment of the trachea tissues with 


Arthrobacter | ureafaciens ^ neuraminidase 
(AUNA) abrogated binding of all constructs as 
shown in Figure 5-4. These results 


demonstrate that glycosylation on the RBD 
affects binding to sialyl ligands on chicken 
trachea tissue. 

The interaction of the variants with 
Neu5Ac(a2-3)Gal(B1-3)GIcNAc, a previously 
established ligand for M41 (11), was assayed 
by ELISA. М145А, N219A, N229A, and 
N246A variants were able to bind the ligand in 
a concentration-dependent manner (Fig. 3B) 
like unmodified RBD. Binding affinities of 
N33A, N59A, N85A, N126A, NI60A, and 
N194A were significantly reduced compared 
to unmodified RBD and comparable to that of 
a negative control protein, the S1 of turkey 
coronavirus TCoV, with specificity for non- 
sialylated diLacNAc glycans (17). Fig. 3C 
shows the ELISA absorbance at the 75 nmol 
ligand concentration for each construct. No 
significant difference was observed for 
variants N145A, N219A, N229A and N246A 
compared to unmodified RBD (shown in dark 
grey bars). All other variants (shown in light 
grey bars), demonstrated significantly lower 
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affinity for the receptor, consistent with 
histochemistry and ligand titration plot results. 


Overall glycosylation of non-binding variants 
is similar to M41-RBD. Six of the ten single 
glycosylation site variants lost the ability to 
bind ligand. To investigate if global changes 
in glycosylation may have affected binding we 
analyzed release glycans from each protein. 
Matrix Assisted Laser Desorption/Ionization — 
Time of Flight (MALDI-TOF) mass 
spectrometry (MS) analysis of enzymatically 
released and permethylated glycans allows for 
semi-quantitative analysis of X glycan 
compositions. The method 15 particularly 
useful for samples containing sialylated 
glycans since they аге stabilized by 
permethylation. The percent abundances of 
glycans identified in each sample are shown in 
Fig. 4. 

The majority of the N-to-A variants, as 
well as the V57A and V58A control variants, 
had similar MALDI-TOF-MS permethylation 
profiles (Fig. 4). Over 100 glycan 
compositions were identified ranging from 
high mannose glycans to large complex ones. 
Nearly half of the glycans contained at least 
one and up to three sialic acid molecules in all 
samples. The most intense glycoforms 
clustered in five groups with increasing 
amounts of complexity as reflected by the 
number of N-acetyl glucosamines 
(HexNAc’s). These included high mannose, 
complex and hybrid forms as follows: I. Hexs- 
9HexNAcz (high mannose); II. NeuAco.1Hexs- 
6dHexo-1HexNAc3 (complex and hybrid); Ш. 
NeuAcooHexsdHexiHexNAc4(complex); IV. 
NeuAco.1HexedHexiHexNAcs (complex) and; 
У. NeuAc2Hex7dHexiHexNAce (complex). 
High mannose glycans were less abundant in 
unmodified M41 than in variant RBDs. The 
М194А, N219A, and N229A variants 
contained diminished amounts of the group V 
high mass complex glycans. The М145А 
variant was the most atypical with less defined 
clustering in the common clustering regions of 
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the spectrum and higher abundances in 
spectral regions where compositions had less 
Hex and more HexNAc overall. For instance, 
cluster IV was shifted from glycans with 6 
hexoses (NeuAcoiHexedHexiHexNAcs) to 
glycoforms with 3 to 4 hexoses (NeuAco. 
1Hex3-4dHex;HexNAcs). More abundance was 
observed in regions containing 6 HexNAc 
residues (NeuAco2Hexs.edHexiHexNAce). To 
better understand the difference between 
М145А and the other constructs, we calculated 
the monosaccharide percent mass and average 
mass for each construct. The average mass 
percent for glycans across all released glycan 
pools was Hex (45.8%), HexNAc (42.0%), 
dHex (5.096) and NeuAc (7.296). The NI45A 
construct had the lowest amount of Hex 
(38.6%) and the highest amounts of HexNAc 
(46.0%) and NeuAc (9.8%). The former two 
were 2 standard deviations (SD) or greater 
from the mean (see supplemental Table S-2). 
This indicates that the N145A construct likely 
had shorter, more branched and more highly 
charged glycans on average than the other 
constructs. Two other variants had values 
more than 2 SD from the mean. N229A 
(normal binding) was most abundant in Hex 
(53.6%), and least abundant in HexNAc 
(37.5%) and dHex (3.896) probably due to its 
higher high mannose content. N246A (normal 
binding) had the lowest amount of NeuAc 
(3.6%). This is perhaps a reflection of the 
missing sugars in this variant since site N246 
in other variants was populated with many 
sialylated glycoforms based on site-specific 
analysis (Supplementary Table S-1). 


Glycosylation and site occupancy were 
similar between M4I-RBD, М59А, апа 
NI45A. 

To assess differences in glycosylation on a 
site-to-site basis, glycopeptide LC/MS 
analysis was carried out on unmodified M41 
and two single glycosylation site variants, 
N59A and М145А that represented a non- 
binder and a binder of trachea tissue, 
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respectively. M41-RBD had ten predicted 
glycosylation sites while the variant RBDs had 
nine each. N145A was also of specific interest 
due to the unique glycosylation pattern 
observed in its free glycan profile. As 
cleavage with trypsin alone resulted in 
glycopeptides with more Шап one 
glycosylation site, we also analyzed 
glycopeptides after an additional treatment 
with chymotrypsin, which resulted in one 
glycosite per peptide, the identification of 
more glycopeptides, and decreased ambiguity 
concerning glycosylation site assignment. 
Although a protein may contain the 
sequence (N-X-S/T) where N-glycosylation is 
known to occur, it may not actually be 
glycosylated, or may be glycosylated only part 
of the time. Potential glycosylation sites, their 
predicted glycosylation state, and their 
measured site occupancy are shown in Table 
2. Of the ten glycosites, all but N246 were 
predicted to be glycosylated (occupied) based 
on NetNGlyc analysis 
(http://www.cbs.dtu.dk/services/NetNGlyc- 
1.0/). Percent occupancy was analyzed by 
LC/MS, however a poor signal was obtained 
for the N219 site in M41 and N59A and 
therefore occupancies were not calculated. 
All other sites were estimated to be occupied 
at 89% or greater in M41 and N59A. The 
N145A variant exhibited site occupancy at all 
expected sites, including N219, although 
signal intensity at that site was low. Two sites 
had much lower occupancy in М145А as 
compared to the other samples. Site N126 
dropped to 6196 occupancy and site N246 to 
79% occupancy compared to nearly complete 
occupancy in the N59A and M41 proteins. 
Overall site occupancy was high for all sites. 
The difficulty in detecting some of the 
peptides, particularly N219, may be due to 
hydrophobicity. Ionization is partially driven 
by hydrophobicity and N219 only had 20% 
hydrophobic character after the 2 digestions 
which may, in рап, explain its low 
detectability. By comparison, glycopeptides 
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containing N85, N145 and М160 were short 
and between 21 and 33% hydrophobicity 
while glycopeptides containing other sites had 
predicted hydrophobicity ranging from 37 to 
6196 and tended to produce higher intensity 
spectra. 

Glycoform relative abundances at each 
site are listed in Supplementary Table S-1. 
Fig. 5 shows the location of each 
glycosylation site on the RBD of M41. Overall 
compositions at each site were similar in 
charge and size across the three constructs. A 
representative glycan is shown at each site 
based on peak intensity. The N145A construct 
had glycoforms like those identified by 
MALDI-ToF MS with more HexNAc and 
fewer Hex compared to M41 and N59A. 

Fewer overall glycan compositions 
were detected on glycopeptides by LC/MS 
compared to the free glycans observed by 
MALDI-TOF MS (63 versus 100 
compositions). This can be expected since the 
technology of instrumentation used and the 
physiochemical characteristics of 
permethylated glycans and  glycopeptides 
differ significantly. The forms detected 
overlapped between the two analyses. 


Docking results are dependent on 
glycosylation status of the M41-RBD protein. 
During the course of our investigation, the 
first structure of the M41 spike protein was 
solved using electron microscopy (EM) (18). 
Mapping the glycosylation sites onto the 
structure did not lead to a clear understanding 
of how the mutations affect binding. While 
EM structural resolution is limited, and the 
precise coordinates for the attached glycans 
are not known, an attempt was made to dock a 
series of potential sialylated ligands to a 
glycan-stripped structure of the RBD and a 
structure that was populated with glycans 
based on our data. The glycan chosen for each 
site on the RBD was based on the predominant 
glycans identified at each site by LC/MS (see 
Fig. 5). 
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Seventeen oligosaccharide ligands 
were chosen based on a previous glycan array 
study of M41 (11) and ELISA data (this 
work). Both strong and weak binders were 
selected (Fig. 6). Each ligand was docked 20 
times against both the sugar-stripped and in 
silico glycosylated M41-RBD coordinates. 
There was no statistical difference between the 
docked binding energies of ligands that did 
and did not bind on the array. All 
oligosaccharide ligands, except for 1, 3, 9, 13, 
15 and 17 docked seven or more times to one 
or more of four sites on the M41 sugar- 
stripped structure with no clear pattern 
differentiating between them (Fig. 6). In the 
sugar-stripped structure, all binding occurred 
at Sites A and B. Site A is under the galectin 
fold near site N194, and Site B encompasses 
N85 and М59. All three glycosylation sites are 
required for binding to trachea tissue. The 
docking pattern changed dramatically when 
glycans were modeled onto the structure. The 
most dramatic change was seen at Site D 
where 8 ligands bound 7 or more times whilst 
interactions at all other sites decreased. There 
were no binders at Site A, only 2 at C (3 and 
16) and 3 at Site B (6, 9 and 17). All of the 
ligand oligosaccharides that docked at Site D 
were sialylated, consistent with ligands 
identified by array and ELISA. No control 
ligand (1 and 2 uncharged; 3 and 4 KDN 
charged) bound at Site D. The interaction at 
Site D involved both sugar-protein and sugar- 
sugar contacts and, in some docking runs, the 
interaction was completely sugar-sugar. Site D 
is in the center of a circle of glycosylation 
sites that showed altered binding profiles 
when mutated; N59A, N85A, N160A lost the 
ability to bind, while N145A gave a very 
strong signal in the histochemical assay. 

Of note, no ligands docked in the site 
at the top of the galectin fold where many 
structural homologs of M41 are thought to 
bind sugars, such as the bovine coronavirus 
RBD (19). For comparison, we docked 
Neu5Ac(a2-6)Gal(B1-3)GIcNAc(B-OMe) 
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against the crystal structure of the bovine 
RBD. Twenty-five out of twenty-five times 
the glycan docked in the proposed binding site 
at the top of the galectin fold in the negatively 
charged area of the bovine RBD control near 
N198 (Fig. 7B). 


Discussion 
Previously we established that the IBV M41 
Sl protein binds sialic acid substituted 
glycoconjugate ligands in chicken trachea and 
lung tissue (11). Intriguingly, the M41 КВР is 
highly glycosylated with 10 potential 
glycosylation sites and glycosylation appears 
to be necessary for binding to host tissues 
since treating the protein with a neuraminidase 
diminishes binding (11). The present study 
extends our investigation toward determining 
the role of glycosylation in the function of the 
RBD, which encompasses the N-terminal 
region of the native protein. Each of the 
potential glycosylation sites was individually 
ablated and each construct was examined for 
its ability to bind tissue and an ELISA- 
presented ligand. In addition, the global 
glycosylation profile of every construct was 
surveyed апа glycosylation of three 
representative constructs was examined on a 
site-specific basis. 

Six of the ten glycosylation sites in the 
RBD domain of IBV M41 were essential for 
binding to chicken trachea tissue and an 
ELISA-presented sialylated oligosaccharide 
ligand. CD analysis demonstrated that both 
secondary structure and stability were similar 
across all the RBD constructs indicating the 
proper fold was likely retained for all. 
Globally, percent abundances of sialylated 
glycans differed across mutants but the 
differences were not associated with loss of 
binding. For example, 5196 and 2096 of the 
glycans in binding mutants М145А апа 
N246A, respectively and 46% and 51% of the 
glycans in the non-binders N126A and N160A 
respectively were sialylated (summed from 
Fig. 4). By comparison, 40% of the glycans in 
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the unmodified КВО construct were 
sialylated. On a site-specific basis, some 
glycosylation sites had more sialylation than 
others (Supplementary table S-1). On 
average, each of glycosites N126, N194, N229 
and N246 were sialylated at least 5096 of the 
time. Sites N229 and N246 were in the less 
ordered region of the protein away from the 
galectin fold where binding is associated in 
the docking study. Site N194 is at the bottom 
of the galectin fold and is required for ligand 
binding. Site N126 is at the top of the galectin 
fold and is also required for binding. While 
we cannot conclude that sialylation is required 
at N194 and N126 it is clear that glycosylation 
at these sites serves a role in ligand binding. 

The publication of the cryo-EM 
structure of M41 (18), the first structure of a 
spike protein from a gammacoronavirus, made 
it possible to visualize the distribution of the 
glycosylation sites in the tertiary structure of 
the protein. The study verified the site 
occupancy we observed on M41-RBD since 9 
out of 10 of the glycosylation sites in the EM 
structure were occupied. Site N246, not 
occupied in the EM structure, is on a B-strand 
in the EM structure, and forms close contacts 
with the 51 C-terminal domain in the native 
protein. The C-terminal domain was not part 
of our construct. Therefore, N246 in the 
recombinant constructs was likely in an 
environment much different than that found in 
the full-length protein. 

Many human galectins, and also the 
bovine В-согопауігиѕ spike protein (18), bind 
sugars at what is the top of the B-sandwich 
near site N126 in the RBD constructs (see Fig. 
5). The bovine RBD site N198 closely aligns 
with site N126 of M41 (see Fig. 7). In the 
bovine protein, this demarks the region of 
proposed ligand binding. Loss of N126 in the 
M41 RBD abrogates binding to trachea tissue. 
While ablation of N126 diminishes ligand 
binding, our docking study gave no evidence 
that this is the sialyl ligand binding site in 
M41. Evaluation of the charge distribution in 
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the proposed binding sites indicates that the 
bovine site is negatively charged, whereas 
negative charge in the same region in M41 is 
sparse (Fig. 7). This difference in charge near 
N126 may explain the lack of ligand docking 
in this region (gray p stands in Fig 6B) during 
docking simulations. 

The precise ligand binding region of 
proteins with a galectin fold varies. Rotavirus 
protein VP4, for example, binds sialic acid in 
a groove between the p-sheets of the sandwich 
(20). The clustering of five of six required N- 
glycosylation sites suggests the location of the 
ligand binding site may be on the right of the 
galectin fold as shown in Fig. 5. Our docking 
experiments studying seventeen possible 
oligosaccharide ligands to M41 were not 
conclusive in terms of binding energies but 
did identify four potential saccharide binding 
regions (Fig. 6). Docking also demonstrated 
that glycosylation affects binding in silico 
since one potential site (A; see Fig. 6) lost 
favor while another one, Site D, dramatically 
gained favor when Ше protein was 
glycosylated. Site D is in the center of three 
glycosylated asparagines required for binding 
(N59, N85, and N160) and one whose loss 
results in a very strong histochemical signal 
and has a protein-wide effect on glycosylation 
with increased sialylation (N145). In addition, 
the Site D region is negatively charged (see 
Fig. 7A) like the proposed sialyl ligand 
binding site on the bovine protein (Fig. 7B) 
(19). АП the ligands that interacted with Site 
D were sialylated and included the glycan that 
bound in our ELISA studies. Interestingly, 
carbohydrate-carbohydrate contacts were 
detected in the RBD — ligand interactions at 
site D. This is an intriguing result since 
carbohydrate-carbohydrate interactions, while 
not common, have been reported between 
non-fucosylated antibodies and their receptor, 
in cell-cell adhesion interactions, between 
tumor antigens, апа between bacterial 
receptors and mucin (21-25). А literature 
search did not uncover any reported 


6107 ‘STZ PEN uo 1son3 Aq /бо од ^ мл: шош popeo[uMo(q 


carbohydrate-carbohydrate interactions 
between virus and host. While our docking 
study must be evaluated in the context of the 
higher RMSDs typical of EM structures, and 
the inexactness of modeled oligosaccharides, 
results suggest that a combination of 
carbohydrate-carbohydrate and carbohydrate- 
protein interactions should be considered in 
the binding mechanism. 

In conclusion, we have shown that 
glycosylation of six out of ten sites on the 
M41 IBV RBD аге necessary for the 
interaction of M41 with both trachea tissue 
and Neu5Ac(a2-3)Gal(B1-3)GIcNAc ligand in 
ELISA. Based on occupancy data, at least 
nine out of ten sites were glycosylated in the 
recombinant | MAI-RBD. Deletion of 
individual glycosylation sites had little effect 
on secondary structure but did have some 
effect on overall glycosylation profiles of 
some variants, especially N145A. Some 
differences can be expected since one site, 
with specific glycans, is lost from each 
variant, thus mildly altering overall profiles. In 
silico docking suggests that glycosylation may 
guide ligand binding. Especially intriguing is 
Site D where glycosylation is required for in 
silico docking at that site. The interaction of 
M41 IBV with sialyl ligand may prove to be a 
unique interaction involving both 
carbohydrates and protein. Further 
investigation is warranted. 


Experimental Procedures: 

Ethical Statement. 'The tissues used for this 
study were obtained from the tissue archive of 
the Veterinary Pathologic Diagnostic Center 
(Department of Pathobiology, Faculty of 
Veterinary Medicine, Utrecht University, The 
Netherlands). This archive is composed of 
paraffin blocks with tissues maintained for 
diagnostic purposes; no permission of the 
Committee on the Ethics of Animal 
Experiment is required. 
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Plasmid construction. The pCD5 vector 
containing IBV M41-RBD in frame with a C- 
terminal GCN4 trimerization motif and Strep- 
Tag has been described previously (10). Site- 
directed mutagenesis using the Q5 technology 
(New England Biolabs, USA) was performed 
to mutate the asparagine (N)-encoding 
residues of the N-linked glycosylation 
sequence motif N-X-S/T into alanine (A) or 
valine (V) using the primers in Table 3. 
Sequences of the resulting RBDs were 
confirmed by Sanger sequencing (Macrogen, 
The Netherlands). 


Production of recombinant proteins. Human 
embryonic kidney (HEK293T; ATCC CRL- 
3216) cells were transfected with pCD5- 
plasmids using polyethylenimine (PEI) at a 
1:12 ratio. The recombinant proteins were 
purified using Strep-Tactin sepharose beads as 
previously described (11), апа their 
production was confirmed by Western blot 
using Strep-Tactin HRP antibody (IBA, 
Germany). 


Circular Dichroism (CD). Recombinant M41 
and its variants were prepared for CD 
spectroscopy by buffer exchange and 
concentration with four centrifugation cycles 
through 10 kilodalton MWCO Amicon Ultra 
0.5 mL centrifugal filters (UFC 501024) into 
10 mM sodium phosphate, pH 7.75. Final 
concentrations were measured with a Thermo 
Scientific Nanodrop 2000 spectrophotometer. 
CD spectra were collected on a JASCO J-810 
spectropolarimeter with a Peltier thermostated 
fluorescence temperature controller module. 
Samples were diluted to 0.06 mg/mL and 4 
scans accumulated from 285-190 nm with a 
scanning speed of 10 nm/min, Digital 
Integrated Time (D.I.T.) 1 second, bandwidth 
1 nm, and standard sensitivity at 25 °C. A 
thermal melt was done from 25 °C to 95 °C 
with a ramp rate of 1 °C per minute. 
Measurements were taken every 2 degrees at 
222, 218, 215, 212, 208, 205, 196, and 194 
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nm. A full CD scan was collected at 95 °C. 
The temperature was then lowered to 25 °C. 
After allowing the protein to refold for 20 
minutes at 25 °С, a third CD scan was taken at 
25 *C to measure recovery. A Savitzky-Golay 
filter was used to smooth CD data at different 
temperatures for visual comparison (Fig. S-2). 

Secondary structure calculations for 
the CD data collected at 25 °С before the 
thermal melt were processed by Dichroweb 
(16) using the CDSSTR (26), Selcon3 (27), 
and Contill (28) algorithms with protein 
reference set 7. Results from the 3 algorithms 
were averaged and plotted in Fig. 2. 


Protein histochemistry. Histochemistry was 
performed ав previously described (11). 
Briefly, chicken trachea tissues from a seven- 
week-old broiler chicken were sectioned at 4 
um before incubation with RBD proteins at 
100 ug/ml. Desialylated tissues were prepared 
by pre-treatment with 2 mU Neuraminidase 
(Sialidase) from Arthrobacter | ureafaciens 
(AUNA, Sigma, Germany) in 10 mM 
potassium acetate, 2.5 mg/ml TritonX100, pH 
42 at 37 °С overnight before protein 
application. Chicken trachea tissues were from 
a seven-week-old broiler chicken (Gallus 
gallus) obtained from the tissue archive of the 
Veterinary Pathologic Diagnostic Center 
(Department of Pathobiology, Faculty of 
Veterinary Medicine, Utrecht University, The 
Netherlands). 


ELISA. Sialic acids (NeuS5Aca2-3Galf1- 
3GIcNAc-PAA,  3-SiaLc-PAA, GlycoNZ, 
Russia) were coated (1 pg/well) in a 96-well 
maxisorp plate (NUNC, Sigma-Aldrich) at 
4°C overnight, followed by blocking with 3% 
BSA (Sigma) іп РВ5-0, 1%Tween. RBD 
proteins (100 ug/ml) were preincubated with 
Strep-Tactin-HRPO (1:200) for 30 min on ice, 
before applying them to the plates for 2 hours 
at room temperature. 3,3',5,5'- 
tetramethylbenzidine (TMB) substrate was 
used as a peroxidase substrate to visualize 
binding, after which the reaction was 
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terminated using 2N Н2504. Optical densities 
(ODasonm) were measured in а FLUOStar 
Omega (BMG Labtech) microplate reader, and 
MARS Data Analysis Software was used for 
analysis of the data. Protein samples of each 
recombinant protein were measured at each 
concentration in triplicate. Statistical analysis 
was performed by comparing each protein to 
the unmodified RBD using 2way-ANOVA 
with Dunnett’s multiple comparisons test 
where alpha was set to 0.05. 


Glycopeptide preparation, enrichment, and 
N-glycan release. The workflow is shown in 
Supplementary Fig. S-3. Aliquots of between 
200 - 400 ug of M41, N59A and N145A, and 
50 ug of the remaining proteins were digested 
with trypsin as per An & Cipollo (29). 
Approximately 25 - 100 ug aliquots of 
protease digested proteins were processed for 
deglycosylated glycopeptide and 
permethylated glycan analyses. Samples were 
resuspended in 50 mM ammonium 
bicarbonate at pH 8.0. Glycans were released 
by digestion with 10 U/uL PNGase F 
(Glycerol-free from NEB) for 3 hours at 37 
°C. The samples were adjusted to pH 5.0 with 
2-4 uL of 125 mM НСІ. To maximize glycan 
release, samples were further digested with 
0.15 mU/uL PNGase A overnight at 37 °C. 
Free glycans and deglycosylated peptides 
were separated using C18 SPE cartridges 
(Thermo). Intact glycopeptide analyses were 
performed using 175 - 300 pg of НШС 
enriched glycopeptides as per An & Cipollo 
(29) Following data collection on the 
trypsinized glycopeptides, the remainder of 
the M41, N59A, and N145A samples were 
digested with chymotrypsin at a ratio of 1:20 
overnight at 25 °С and HILIC enriched a 
second time (for the M41 and N59A samples 
only) prior to LC/MS analysis. 


Site occupancy. LC/MSF data was collected 
on trypsinized peptides deglycosylated with 
PNGase F as described under М-ојусап 
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release. Asparagines (Asn) that аге 
deglycosylated by PNGase F are converted to 
aspartate (Asp) with a mass gain of 0.984 
daltons due to the replacement of -NH2 with - 
OH. The percent occupancy for each site is 
calculated by comparing the intensity ОГ 
peptides with Asn to those with Asp. 
However, spontaneous deamidation of 
unmodified Asn to Asp can also occur. О 
water, which results in mass shift of 2.984 
daltons, was used to ensure calculated percent 
occupancy was not skewed due to spontaneous 
deamidation. This experiment allows for 
examination of both spontaneous апа 
enzymatically catalyzed deamidation and, 
therefore, accurate estimations of percent 
occupancy of glycosites can be determined. 
Percent occupancy was calculated by 
comparing the intensities of Ше 
deglycosylated (DG) and non-glycosylated 


(NG) peptides using the equation: 
DG/(DG+NG)* 100. 
Purification, permethylation, and semi- 


quantitation of free glycans. PNGase-released 
N-glycans were applied to C18 SPE and eluted 
with 0.1% formic acid leaving Ше 
deglycosylated peptides bound to the C18 
column. The glycan eluate fractions were 
combined, and butanol added to a final 
concentration of 1%. The samples were then 
loaded onto 100 mg porous graphite columns 
(PGC) prepared first by sequential washes of 1 
mL 100% acetonitrile (ACN), 1 mL 60% 
ACN in water, 1 mL 30% ACN in water, and 
1 mL water. All solutions contained 0.1% 
trifluoro acetic acid (TFA). The loaded 
columns were washed three times with 1 mL 
0.1% ТЕА in water, then eluted with 30% 
ACN /0.1% TFA/water, followed by 60% 
ACN/0.1% TFA/water. The eluents were 
pooled and dried in glass vials by rotary 
evaporation. Permethylation was done 
following the method of Cincanu et al. 
(30,31). MALDI-TOF analysis of 
permethylated N-glycans was performed on a 
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Bruker Autoflex™ speed mass spectrometer 
in positive polarity reflectron mode. 2,5- 
Dihydroxybenzoic acid (DHB) was used as a 
matrix and maltooligo-saccharides were used 
as an external calibrant. Data were processed 
using FlexAnalysis™. Each sample was 
spotted three times and scans were collected in 
positive reflectron mode. Peaks were picked, 
assigned, and intensities averaged across each 
set of spots using in-house software. 
Assignments were based on glycans known to 
be present in HEK293T cells. 


Reverse Phase NanoLC/MS£ Analysis of 
Glycopeptides and Peptides. Each peptide or 
glycopeptide sample was analyzed three times. 
A C18 column (ВЕН nanocolumn 100 um i.d. 
x 100 mm, 1.7 um particle, Waters 
Corporation) was used for nanoLC/MS* 
analyses. A Waters nanoAcquity ОРІС 
system was used for automatic sample loading 
and flow control. Load buffer was 3% ACN, 
97% water. Peptides were eluted via a 60- 
minute gradient from 3 to 50% ACN with a 
flow of 0.4 uL/min. All chromatography 
solutions included 0.1% formic acid. The 
eluent flowed to an uncoated 20 um i.d. 
PicoTip Emitter (New Objective Inc., 
Woburn, MA). The mass spectrometer was a 
Waters SYNAPT G2 HDMS system (Waters 
Corp. Milford, MA). Applied source voltage 
was 3000 V. Data was collected in positive 
polarity mode using data independent MS® 
acquisition, which consists of a starting 4V 
scan followed by a scan ramping from 20 to 
50V in 0.9 seconds. To calibrate internally, 
every 30 seconds, 400 fmol/uL 
Glufibrinopeptide B with 1 pmol/ul leucine 
enkephalin in 25% acetonitrile, 0.1% formic 
acid, 74.9% water was injected through the 
lockmass channel at a flow rate of 500 
nL/minute. Initial calibration of the mass 
spectrometer was performed in MS? mode 
using Glufibrinopeptide B and tuned for a 
minimum resolution of 20,000 fwhm. 
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Data Analysis for Peptide and Glycopeptide 
identification. | NanoLC/MS* data were 
processed using BiopharmaLynx 1.3 (Waters) 
and GLYMPS (in-house software) (32,33) to 
identify specific glycans on each peptide. The 
search settings included trypsin digest with up 
to 1 missed cleavage, fixed cysteine 
carbamidomethylation, variable methionine 
oxidation, and variable N-glycan 
modifications based on a building block 
glycan library. Assignment inclusion criteria 
were: (1) the presence of a core fragment 
(peptide, peptide + НехМАс, peptide 
+НехМАс», peptide + dHexiHexNAci, and 
peptide + HexiHexNAc?); (2) the presence of 
three or more peptide fragments; (3) the 
presence of three or more assigned 
glycopeptide fragments; (4) assignment is 
made in at least 2 out of 3 injections, and; (5) 
the existence of the glycan in GlyConnect 
(https://glyconnect.expasy.org). 


Docking. Residues 21 to 268 of the M41 spike 
EM structure were extracted from the 
published structure (PDB code бсу0) (18). 
This corresponds to the M41-RBD used in this 
paper. Glycam-web’s Glycoprotein-builder 
program (34) was used to add the major 
oligosaccharide found at each glycosylation 
site onto the protein in silico. All glycosites in 
the M41 EM structure were occupied except 
N246; however, N246 was occupied in our 
data and was populated accordingly. АП 
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glycosites were glycosylated in the new PDB 
file based on best evidence from our MS data. 
The coordinates of MAI-RBD without 
glycans, MAI-RBD with modeled glycans, 
and bovine RBD (PDB code 4H14) were used 
in docking experiments. A virtual library of 17 
oligosaccharides representing a variety of 
binding epitopes was created based on the 
CFG array v. 4.2 (see Fig. 6 for a list). Raw 
models of the oligosaccharide ligands were 


created with the AMBER tool tleap 
(www.ambermd.org) utilizing the 


GLYCAMO6 force field (35), then energy 
minimized using YASARA (36). Dock 
screening of the library was performed with 
the YASARA implementation of Autodock 
Vina (37) with default parameters. A 
molecular dynamics simulation with explicit 
water (TP3) but with fixed coordinates for the 
backbone atoms was run on the glycosylated 
M41 RBD model to allow the amino acid side 
chains to accommodate the added glycans and 
to find low energy conformations. Two 
models were extracted from the glycosylated 
MD RBD run at 5 and 10 ns, which were used 
for dock screening with the virtual library. 
Each oligosaccharide ligand was docked 
against the structures 20 times. Docking 
results shown in Fig. 6 are for the 10 ns 
model. Results were similar in the 5 ns 
models. 
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Tables 
Table 1. Techniques used in this paper. 
Material Samples Technique Outcome 
Protein! All? CD Secondary structure 
and stability 
Protein All Tissue histochemistry Binding affinity to 
tissues 
Protein MA], М-о-А | ELISA Binding affinity to 
variants sialic acid 
Released glycans All MALDI-TOF MS Percent abundance 
of glycoforms 
Sugar-free peptides M41, М59А, | LC/MS® Site occupancy 
М145А 
Glycopetides M41, М59А, | LC/MS® Assignment of site- 
М145А specific glycoforms 
Protein structure M41 In silico docking Potential binding 
sites 


! Recombinant protein consisting of the first 253 residues of the RBD of the IBV M41 spike 
protein, a GCN4 trimerization motif, and a Strep-tag. 

2 M41 (unmodified), all the N-to-A variants, and two non-glycosylation variants: V57A and 
V58A 


Table 2. Potential glycosylation sites based on sequence, predicted glycosylation sites by 
NetNGlyc (www.cbs.dtu.dk/services/NetNGlyc), and site-occupancy measured by LC/MS. 


Potential nan | 4 
Glycosylation Sites INetNGlyc Predictions Site Occupancy 


Position! Sequence Potential’? |Resut | | M41 


126 INLTV 0.7729 +++ 97.3 + 0.4 98.6 + 0.4 61.4+2.2 


2» мө: bss ү bazo2 jov oo 
246 INTTF 0.4726 52 p40t34 [96.6+0.1 9.42.5 
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Sequence position is based on the mature protein 

The higher the NetNGlyc potential, the more likely it is to be glycosylated 

?This site is not likely to be glycosylated. 

“Average percents and standard deviations are calculated from three separate LC/MS” injections. 
?Where percent occupancy = 100, the intensity of the never-glycosylated peptide was too low to 
detect. 

°Sites missing in the glycosylation variants are noted with NA. Not determined is noted as ND 
ТВоћ glycosylated and non-glycosylated forms were detected but incomplete cleavage and low 
signal intensity precluded accurate approximation of occupancy. 

*Masses matching the spontaneously deaminated and de-glycosylated peptide were not found. 


Table 3. Primers used for site-directed mutagenesis to generate N-to-A and V-to-A substitutions. 
The sequence encoding Alanine is in a lower case. 


Mutant Primer sequence for N>A and V>A substitutions 
N33A FW: CGCTGTGGTGgctATCTCCAGCG 
RV: TAAGCTCCTCCATGCAGG 
У57А FW: GGAGGAAGGgcgGTGAACGCC 
RV: GTGAATTGTGCCCACGATG 
V58A FW: GGAAGGGTGgcgAACGCCTCC 
RV: TCCGTGAATTGTGCCCAC 
N59A FW: AAGGGTGGTGgccGCCTCCAGCA 
RV: CCTCCGTGAATTGTGCCC 
N85A FW: AGCCCACTGT gctTTTAGCGACACC 
RV: GTGCAGAACTGGGAGCTG 
N126A FW: GCTGTTCTACgctCTGACAGTGTCCGTGG 
RV: TGGCCGTTCTTCATGGCG 
N145A FW: GTGCGTGAACgctCTGACCTCCG 
RV: TGGAAGCTCTTAAAGGTTG 
N160A FW: GTATACATCCGCTGAGACCACAGATGTGACCAGC 
RV: ACCAGGTCGCCGTTCAGG 
N194A FW: CTACTTCGTGgctGGCACAGCCCAGGAC 
RV: GCCAGGGCCTTCACCTCC 
N219A FW: CAACACCGGAgctTTCTCCGATGGC 
RV: TACTGACAGGCCAGCAGT 
N229A FW: TCCGTTCATCgccAGCTCCCTGG 
RV: TAAAAGCCATCGGAGAAATTTC 
N246A FW: GAACAGCGTGGCTACCACATTCAC 
RV: TCGCGGTACACAATAAAC 
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Figure Legends 


Figure. 1. Western Blot verifying production of M41 RBD proteins. Recombinant viral 
proteins were produced by transfection of HEK293T cells. The soluble proteins were purified 
from the supernatant using Strep-tactin beads and analyzed by Western blot using a Strep-Tactin 
HRP antibody. 


Figure. 2. Calculated secondary structure for each variant based on CD data. Each bar 
represents the average results from three algorithms in Dichroweb. Standard deviations are 
indicated at the top of each color. From the bottom, the bar segments represent a-helix (blue), p- 
strand (red), turns (yellow), and unordered (green). 


Figure. 3. Tissue and ELISA binding assays. Histochemical assays of recombinant unmodified 
M41-RBD and single N-to-A and V-to-A glycosylation variants to (A) trachea tissue and (B, C) 
ELISA-presented Neu5Aca2-3Galp1-3GlcNAc. (B) Concentration dependence of binding. (С) 
Absorbance for each protein at the 75 nmol concentration. Two-way ANOVA showed 
significantly less binding by variant N33A, N59A, N85A, N126A, N160A and N194A КВР 
proteins compared to unmodified RBD (compare light grey bars (variant) to unmodified (black 
bar)). No significant difference was observed for variants with dark grey bars. Data points are 
averaged from three separate assays. 


Figure. 4. Free glycans identified by MALDI-ToF analysis. Data for M41 and variants are 
arranged in columns. Assigned glycans are on the Y-axis. Blue bars represent the average 
percent abundance across three measurements. Standard deviation is indicated with black lines 
on top of the bars. Glycan compositions are arranged by increasing complexity, starting with 
high mannose (i.e. Hex5HexNAc2) at the top and ending with the larger complex forms at the 
bottom. Yellow and white shading groups glycan compositions with increasing numbers of 
HexNAcs moving from top to bottom. Abbreviations are hexose (Hex), N-acetyl glucosamine 
(HexNAc), deoxyhexose (dHex), sialic acid (NeuAc). 


Figure. 5. Site-specific glycosylation of M41, N59A, and N145A. The S1-N terminal receptor 
binding domain residues 21-268 from PDB entry 6cvO is represented in gray ribbons. The 
asparagines of glycosylation sites which could still bind trachea tissue after mutation to alanine 
are in cyan, those that could not are in dark red. N-acetyl-glucosamine residues from the 
structure are dark blue balls and sticks. The most predominant glycan for each site across all 
three constructs is shown to the right. Glycoforms shown on the right are based on our data and 
inferred structural detail based on accepted knowledge of the cell type used in protein 
production. Monosaccharides are represented as follows: Mannose (green circles), galactose 
(yellow circles), N-acetyl-glucosamine (blue squares), fucose (red triangles), and sialic acid 
(purple diamonds). Numbering of the sites is based on the mature sequence. The figure was 
made with CCPAMG (38) and GIMP (www.gimp.org). 


Figure. 6. Docking results. Top: List of all oligos docked to the M41-RBD. Columns with ‘x’ 
indicate the sugar in that row docked 7 or more times out of 20 at the indicated site on the 
protein. Array scores are from Wickramasinghe et al., 2011 (11), referenced in the figure as 
footnote 1. White columns were against structure without sugars, gray columns were with 
LC/MS-identified sugars modeled on. Bottom: RBD-binding domain of M41 Кот PDB structure 
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Avian coronavirus glycosylation 


бсу0. Glycosylation sites are shown as cyan balls. Sites where two or more oligosaccharides 
docked seven or more times are indicated as colored, space-filled amino acids. Colors and labels 
match the table above. Panel (B) is (A) turned 90° toward the user. Structure representations 
made іп  CCPA-MG (38) Sugar symbols rendered with DrawGlycan-SNFG 
(www. virtualglycome.org/DrawGlycan/). 


Figure. 7. Charge distribution looking down on the potential sialic acid binding site of M41 
(A) and bovine (B) RBDs. Orientation of both proteins matches that of Figure 6B. Positive 
electrostatic charge is blue, negative is red. Sugars are gray boxes on (A) and pink boxes on (B). 
Y162, E182, W184, and H185 in (B) are involved in binding to sialic acid. Large * in (A) 
indicates possible binding site based on structural comparison between the two proteins. Images 
made with CCP4-MG (38). Bovine coordinates from PDB code 4H14. 
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Figure. 1. Western Blot verifying production of M41 RBD proteins. 
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Figure. 2. Calculated secondary structure for each mutant based on CD data. 
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Figure. 3. Tissue and ELISA binding assays. 
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Figure. 4. Free glycans identified by MALDI-TOF analysis. 


ТРИ 
VEEN 
VZSA 
8SA 
V6SN 
VS8N 
9cTN 
SvTN 
O9TN 
v6TN 
6TZN 
6ccN 
9vcN 


Hex3HexNAc2 
Hex4HexNAc2 
Hex5HexNAc2 
Hex6HexNAc2 
Hex7HexNAc2 
Hex8HexNAc2 
Hex9HexNAc2 
Hex3HexNAc3 
dHex1Hex3HexNAc3 
Hex4HexNAc3 
dHexlHex4HexNAc3 
NeuAclHex4HexNAc3 
NeuAcldHexlHex4HexNAc3 
Hex5HexNAc3 
NeuAclHexSHexNAc3 
NeuAc2HexSHexNAc3 
dHexlHex5HexNAc3 
NeuAcldHexlHex5HexNAc3 
Hex6HexNAc3 
NeuAclHex6HexNAc3 
dHex1Hex6HexNAc3 
NeuAcldHexlHex6HexNAc3 
Hex3HexNAc4 
dHexlHex3HexNAc4 
Hex4HexNAc4 
dHexlHex4HexNAc4 
dHex2Hex4HexNAc4 
NeuAclHex4HexNAc4 
NeuAcldHexlHex4HexNAc4 
Hex5HexNAc4 
NeuAclHexSHexNAc4 
NeuAc2Hex5HexNAc4 
dHexlHex5HexNAc4 
NeuAcldHexlHexSHexNAc4 III 
NeuAc2dHexlHex5HexNAc4 
dHex2HexSHexNAc4 
Hex6HexNAc4 
dHex1Hex6HexNAc4 
NeuAclHex6HexNAc4 
NeuAc2Hex6HexNAc4 
NeuAc3Hex6HexNAc4 
NeuAcldHexlHex6HexNAc4 
Hex7HexNAc4 
NeuAclHex7HexNAc4 
NeuAc2Hex7HexNAc4 
НехЗНехМАс5 
dHexlHex3HexNAc5 
Hex4HexNAc5 
dHexlHex4HexNAc5 
NeuAcldHexlHex4HexNAc5 
NeuAcldHex2Hex4HexNAc5 
dHex2Hex4HexNAc5 
NeuAclHex4HexNAc5 
Нех5НехМАс5 
dHexlHexSHexNAc5 
NeuAcldHexlHex5HexNAc5 
NeuAc2dHex1Hex5HexNAc5 IV 
Hex6HexNAc5 
NeuAclHex6HexNAc5 
NeuAc2Hex6HexNAc5 
NeuAc3Hex6HexNAc5 
dHexlHex6HexNAc5 
NeuAcldHexlHex6HexNAc5 
NeuAc2dHexlHex6HexNAcS 
NeuAc3dHexlHex6HexNAc5 
dHex2Hex6HexNAcS 
NeuAcldHex2Hex6HexNAc5 
Hex7HexNAc5 
NeuAclHex7HexNAc5 
NeuAc2Hex7HexNAc5 
NeuAc3Hex7HexNAc5 
NeuAcldHexlHex7HexNAc5 
dHex1Hex3HexNAc6 
dHex2Hex3HexNAc6 
dHexlHex4HexNAc6 
NeuAcldHexlHex4HexNAc6 
HexSHexNAc6 
NeuAclHex5HexNAc6 
dHexlHexSHexNAc6 
dHex2Hex5HexNAc6 
NeuAcldHexlHex5HexNAc6 
NeuAc2dHex1lHexSHexNAc6 
Hex6HexNAc6 
NeuAclHex6HexNAc6 
NeuAc2Hex6HexNAc6 
NeuAcldHexlHex6HexNAc6 
NeuAc2dHexlHex6HexNAc6 
NeuAc3dHexlHex6HexNAc6 
NeuAclHex7HexNAc6 
NeuAc2Hex7HexNAc6 
dHexlHex7HexNAc6 
NeuAcldHexlHex7HexNAc6 
NeuAc2dHexlHex7HexNAc6 
NeuAc3dHexlHex7HexNAc6 
dHex2Hex7HexNAc6 
NeuAcldHex2Hex7HexNAc6 
NeuAc2Hex8HexNAc6 
dHexlHex4HexNAc7 
dHexlHex6HexNAc7 
dHex1Hex7HexNAc7 
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Figure. 5. Site-specific glycosylation of M41, N59A, and N145A 
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Figure. 6. Docking results. 


IUPAC-condensed nomenclature 


Array score! 


Bas GlcNAc(b1-3)Gal(b1-4)GIcNAc(b-OMe) 28 £35 
ШИЕ GlcNAc(b1-4)Gal(b1-4)GIcNAc(b-OMe) 24+9 
| | |х| | KDN(a2-3)Gal(b1-3)GlcNAc(b-OMe) NA 
ЕН KDN(a2-3)Gal(b1-4)GlcNAc(b-OMe) 0%3 
sass Neu5Ac(a2-3)Gal(b1-3)GlcNAc(a-OMe) 16 +17 
Е Neu5Ac(a2-3)[Neu5Ac(a2-6)]GalNAc(a-OMe) 33414 
| x | Neu5Ac(a2-3)GalNAc(a-OMe) 12 #2 
үй Neu5Ac(a2-3)Gal(b1-3)[Fuc(a1-4)]GlcNAc(b-OMe) 363 + 20 
ІНЕ Neu5Ac(a2-3)Gal(b1-3)[Neu5Ac(a2-3)Gal(b1-4)]GlcNAc(b-OMe) | 3308 + 257 
MMA Neu5Ac(a2-3)Gal(b-OMe) 32 +24 
EAS Neu5Ac(a2-3)Gal(b1-3)GIcNAc(b-OMe) 285 + 101 
Neu5Ac(a2-3)Gal(b1-4)[Fuc(a1-3)]GlcNAc(b1-3)Gal(b-OMe) 5%7 
Neu5Ac(a2-3)Gal(b1-4)GIcNAc(b-OMe) 6+5 


Neu5Ac(a2-3)Gal(b1-4)Glc(b-OMe) 


Neu5Ac(a2-6)Gal(b1-4)GIcNAc(b-OMe) 15 + 14 
Neu5Ac(a2-6)Gal(b1-4)Glc(b-OMe) 4+3 
Neu5Ac(a2-8)Neu5Ac(a2-8)Neu5Ac(b-OMe) 11+6 
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Figure. 7. Charge distribution looking down on the potential sialic acid binding site of M41 
(A) and bovine (B) RBDs. 


Ү162,Е182, 
W184,H185 
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